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We reconsider the existing kernel estimators for a copula func- 
tion, as proposed in Gijbels and Mielniczuk [Comm. Statist. Theory 
Methods 19 (1990) 445-464], Fermanian, Radulovic and Wegkamp 
[Bernoulli 10 (2004) 847-860] and Chen and Huang [Canad. J. Statist. 
35 (2007) 265-282]. All of these estimators have as a drawback that 
they can suffer from a corner bias problem. A way to deal with this is 
to impose rather stringent conditions on the copula, outruling as such 
many classical families of copulas. In this paper, we propose improved 
estimators that take care of the typical corner bias problem. For Gij- 
bels and Mielniczuk [Comm. Statist. Theory Methods 19 (1990) 445- 
464] and Chen and Huang [Canad. J. Statist. 35 (2007) 265-282], the 
improvement involves shrinking the bandwidth with an appropriate 
functional factor; for Fermanian, Radulovic and Wegkamp [Bernoulli 
10 (2004) 847-860], this is done by using a transformation. The theo- 
retical contribution of the paper is a weak convergence result for the 
three improved estimators under conditions that are met for most 
copula families. We also discuss the choice of bandwidth parame- 
ters, theoretically and practically, and illustrate the finite-sample be- 
haviour of the estimators in a simulation study. The improved esti- 
mators are applied to goodness-of-fit testing for copulas. 

1. Introduction. Consider a random vector X = [Xi , . . . , X^)^ with joint 
cumulative distribution function H and marginal distribution functions Fi , . . . , 
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Fd- According to Sklar's theorem [see, e.g., Nelsen (2006)], there exists a d- 
variate function C such that 

(1) H{xi, ...,Xd) = C{Fi{xi), Fdixd)). 

The function C is cahed a copula, and it is, in itself, a joint cumulative distri- 
bution function on [0,1]'^ with uniform marginals. If the marginal distribu- 
tion functions Fi, . . . ,Fd are continuous, then the function C is unique and 
C{ui,.. .,ud) = H{F^\ui), . . .,F^\ud)), where, for j = l,...,d, F-\u) = 
'm.i{x : Fj[x) > n}, with u S [0, 1], is the quantile function of Fj. The copula 
C "couples" the joint distribution function H to its univariate marginals, 
capturing as such the dependence structure between the components of 
X = (Xi, . . . ..XdY . 

Methods for estimation of copulas usually depend on how much we are 
willing to assume about the joint distribution function H. In fully para- 
metric approaches with parametric models for both the copula and the 
marginals, maximum likelihood estimation may be used. Nowadays semi- 
parametric estimation is quite popular, in which one specifies a parametric 
copula and estimates the marginals nonparametrically. In this paper, we fo- 
cus on nonparametric estimation of the copula making as such no restrictive 
distributional assumptions on the copula nor on the marginals. 

For simplicity of the presentation, we will restrict to the case d = 2, and 
consider an independent and identically distributed sample (Xi, Yi )"'',... , 
{Xn,Yn)'^ of a bivariate random vector {X, Y)'^ with joint distribution func- 
tion H and marginal distribution functions F and G. 

Nonparametric estimation of copulas goes back to Deheuvels (1979) who 
proposed, in order to test for independence, the following empirical copula 
estimator: 

1 " , . 

Cn{u,v) = -Y^HU, <u,Vi< v} with Ui = Fn{Xi), Vi = GniYi), 
1=1 

where Fn and G„ are the empirical cumulative distribution functions of the 
marginals, and where 1{A} denotes the indicator of a set A. This estimator 
is asymptotically equivalent [up to a term 0(n~^)] with the estimator based 
directly on Sklar's theorem given by 

(2) Cn{u,v)=Hn{F-\ulG~\v)) 

with Hn the empirical joint distribution function. Weak convergence studies 
of this estimator can be found in Ganssler and Stute (1987), Fermanian, 
Radulovic and Wegkamp (2004) and Tsukuhara (2005). Our Monte Carlo 
experiments showed that it is better to use the following (asymptotically 
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equivalent) modification of the empirical copula: 
(3) 

with iif"^ = ^^Fnix.l v!:''^ = ;^G.(y.), 

which shifts the pseudo-observations Fn{Xi) and GniXi) a bit closer to the 
left corner of the unit interval [0, 1] [see, e.g., Genest, Ghoudi and Rivest 
(1995)]. 

Fermanian, Radulovic and Wegkamp (2004) also proposed a smoothed 
version of the empirical copula. Their proposal is a straightforward modifi- 
cation of (3) and the estimator is defined as 

(4) C^^''\u,v)=Hn{F-\u),G-\v)), 

where the quantities i?„, F„ and G„ are given by 

Fn{x) = Hn{x,+Oo), 



with 

Kn{x,y) = K(^,^), K{x,y)= r f k{s,t)dsdt, 

where k{s,t) is a given bivariate kernel density function, and 6„ is a band- 
width sequence tending to zero with n. Fermanian, Radulovic and Wegkamp 
(2004) proved weak convergence of this estimator. 

There are two kernel type estimators in the literature that pay special 
attention to the correction of the boundary bias. This typical bias associ- 
ated with kernel estimation is present since a copula has its support on the 
bounded set [0,1]^. The first reference is the mirror-reflection type estima- 
tor originating from the work of Gijbels and Mielniczuk (1990) on copula 
density estimation. They take care of boundary bias correction through data- 
augmentation obtained by reflecting the original data with respect to the 
edges and the corners of the unit square. The second reference is the esti- 
mator of Chen and Huang (2007), who proposed to use a local linear kernel 
in order to deal with the bias near the boundaries of the unit square. 

A first goal of the present paper is to prove the weak convergence of the 
estimators of Gijbels and Mielniczuk (1990) and Chen and Huang (2007) 
under the assumption that C has bounded second order partial deriva- 
tives on [0,1]^ (see Theorem 1 in Section 2). It turns out, however, that 



Hn{x,y) = - ^Knix - Xi,y- Yi] 



(5) 



1=1 



GJx) = HJ+oo,y) 
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for many commonly- used families of copulas (e.g., Clayton, Gumbel, nor- 
mal. Student), the latter condition is not satisfied and the bias behavior at 
the corners of the unit square precludes the weak convergence on the whole 
[0, 1]^. We therefore propose improved "shrinked" versions of the estimators 
of Gijbels and Mielniczuk (1990) and Chen and Huang (2007). This shrink- 
ing is done by including a weight function which removes the corner bias. In 
the same spirit, we also suggest a modification of the copula estimator (4) 
of Fermanian, Radulovic and Wegkamp (2004). In Theorem 2 we estab- 
lish weak convergence for all newly proposed estimators. The finite-sample 
performance of the estimators is demonstrated via a simulation study. We 
discuss optimal bandwidth selection and compare the performances of the 
estimators using various well-known distance measures. 

The second goal of the paper is to discuss the use of the various estimators 
of copulas in goodness-of-fit testing problems. 

The paper is organized as follows. In Section 2, we introduce the im- 
proved kernel estimators and state the main theoretical results on weak 
convergence. In Section 3, we investigate the finite-sample performance of 
the newly-proposed estimators and compare these with performances of ex- 
isting estimators. In Section 4, simulation results are reported for goodness- 
of-fit testing. The proofs of the weak convergence results are given in the 
Appendix. 



2. Nonparametric kernel estimators of a copula. In this section, we briefiy 
discuss existing kernel estimators and propose important modifications. We 
also state the weak convergence results. 



2.1. Local linear kernel estimator. Chen and Huang (2007) constructed 
their estimator in the following way. In the first stage, they estimate marginals 
by 

«=) 6„(.)4p(=^) 

with K the integral of a symmetric bounded kernel function k supported on 
[— 1, 1]. In the second stage, the pseudo-observations Ui = Fn{Xi) and Vi = 
G,i(li) are used to estimate the joint distribution function of the unobserved 
F(Xi) and G{Yi), which gives the estimate of the unknown copula C. To 
prevent boundary bias, Chen and Huang (2007) suggested using a local 
linear version of the kernel k given by 



^ _ k{x){a2{u,h) - ai{u,h)x} ^f u- 1 

[') '^u,h\^j — 7 T\ 7 T\ 27 lT^I Z 

ao[u, n)a2{U, n) — ai[u, n) { n 



u 

<x<- 
h 
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where 

/•u/h 

ai{u,h)= I t^k{t)dt for/ = 0,1, 2. 

J{u-l)/h 

Finally, the local linear type estimator of the copula is given by 

where Ku^h{x) = Jlao ku,h{s) ds . Chen and Huang (2007) derived expres- 
sions for asymptotic bias, variance and mean squared error for this estima- 
tor and showed that a proper choice of the second stage smoothing con- 
stants h = hn may considerably decrease variance and mean squared error 
of the copula estimate. Moreover, their Monte Carlo experiments showed 
that the estimator C^^^ is quite insensitive to the choice of the constants 
bin and h2n used for smoothing the marginals in the first stage. Variance 
considerations provided by the authors even showed that it is reasonable 
to take bin and b2n as small as possible. Note that strong undersmooth- 
ing in the first stage, recommended in Chen and Huang (2007), results in 
using the pseudo-observations (?7j, ViY = ( ^"-^"^^')~^ ^ 2nG„{Y^)-i -^t^ which is 
asymptotically equivalent to the mostly-used pseudo-observations defined in 
(3). 

As already mentioned in the Introduction, the theoretical inconvenience of 
the estimator (8) is that for many common families of copulas (e.g., Clayton, 
Gumbel, normal. Student) the bias of the estimator at some of the corners 
of the unit square is only of order 0(/i„). As the optimal bandwidth for 
distribution function estimation is of order 0{n^^^^), this violates the n^/^- 
order weak convergence on the whole [0, 1]^. 

The problem is caused by unboundedness of second order partial deriva- 
tives of many copula families. Although parametric models with unbounded 
densities are rather rare in "standard" parametric models, copula families 
with unbounded densities are quite common. As a benchmark, we can take 
the normal bivariate density, which is usually supposed to be a well-behaved 
model. But the resulting normal copula density is unbounded. 

To overcome this difficulty, we propose a method of shrinking the band- 
width when coming close to the borders of the unit square. The proposed 
method is based on the observation that, when calculating the bias of the es- 
timator (8), we have to deal with terms of the form h'^Cuui'Uiv), h'^Cuv{u,v) 
and h'^Cyy{u,v), where Cuu{u,v), Cuv{ {u,v) are the second or- 

der partial derivatives of C; that is, Cuu{u,v) = d'^C{u,v)/du'^ and, similarly, 
for Cuv{u,v), Cm{u,v). The problem is that, for many common families of 
copulas, these second order partial derivatives are not bounded, and, in fact. 
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a closer inspection of them shows that 

Cuu{u,v) =o(—-^ — -), C„,{u,v) = o(—^ — -], 
\u{l-u)J \v[l-v)J 

(9) ^ 
Cuv{u,v)=o(^ =)■ 

\y^UV{l - U){1 -V)/ 

This is shown in Appendix D for Clayton, Gumbel, normal and Student cop- 
ulas. In order to keep the bias bounded, we suggest an improved "shrinked" 
version of (8), which is given by 

(10) , 

with b{w) = mm{^yw, y/1 — w), 

where the constant bandwidth hn is replaced by a bandwidth function b{u)hn 
that "shrinks" the value of the bandwidth close to zero at the corners of the 
unit square. A straightforward adaptation of the result of Chen and Huang 
(2007) gives that, for {u/b{u),v/b{v)) G [hn,l — hn\^ (and no smoothing of 
the marginals in the first stage), 

Bias{C'(LLS)(^,^)} 

(11) 

2 

= ^hl{b\u)Cuu{u,v)+b'{v)C,,{u,v)} + o{hl), 



(12) 



Var{C^LLS)(u,'t;)} 
_ 1 
n 

hnbK 



n 



Var[I{C/ <u,V<v}- Cu{u,v)l{U <u]- C^{u,v)l{V < v}] 
[b{u)Cu{u,v){l-Cu{u,v)) 
+ b{v)C,{u,v){l - C,{u,v))]+o(^^^ 

with aj^ = j\t^k{t)dt, 6/^ = 2 j\tk{t)K{t) dt and 6(-) as defined in (10). 

Taking b{w) = 1 gives back the bias and variance expressions for C^^^ in 
Chen and Huang (2007) (in case of no smoothing at the first stage). 

The improvements are obtained by shrinking the bandwidth through the 
function 6(a,it;) = min{w",(l — w)"}. Different choices of a or different 
choices of shrinking factors are possible, but our extensive investigations 
showed that b{w) = mm{^/w, \/l — w} is overall a very good choice. The 
choice of a possible optimal shrinking factor is an open question. 
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2.2. Mirror-reflection kernel estimator. Another version of a kernel es- 
timator for the copula might be obtained by integration of the estimator of 
the density of the copula introduced and studied in Gijbels and Mielniczuk 
(1990). This estimator deals with the boundary problem by the technique 
known as mirror-reflection. If a multiplicative kernel k{x,y) = k{x)k{y) is 
used, then the mirror-reflection estimate of the copula has a simple form 



1 



(13) 



1=1 i=l 



K 



u - U 



K 



K 



hn 



K 



(^) 



where {(C/f \ f/^^), i = l,...,n,^ = 1,...,9} = {{±Ui,±V^),{±Ui,2 - F,), 
{2-Ui,±Vi),{2-Ui,2-V,),i = l,...,n}. 

The mirror- type estimator (13) faces the same "corner bias" problem as 
the local linear estimator (8). To prevent this problem, we can "shrink" the 
bandwidth similarly as in (10) and propose 



1 



n 



(14) 



n 9 

EE 



K 



b{u)hn 



K 



b{u)hn 



, v-V- 
K' 



b{v)hr, 



K 



(^) 



h(v)hn 



2.3. Transformation estimator. The unboundedness of the densities of 
many copula families brings us back to Sklar's theorem in (1) and to the 
estimator (4) proposed in Fermanian, Radulovic and Wegkamp (2004). 

To control the bias of this estimator in order to achieve weak conver- 
gence, we need the boundedness of the second order partial derivatives of 
the original joint distribution H. As the bivariate normal benchmark exam- 
ple shows, this condition may be considerably weaker than the requirement 
of the bounded second order derivatives of the underlying copula C. 

A possible methodological objection to the estimator Cn , defined in 
(4), may be its dependence on the marginal distributions. This is confirmed 
by Monte Carlo simulations which show that, for a given copula, the success 
of this estimator depends on the marginals crucially. 

As the copula function is invariant to increasing transformations of the 
margins, it is possible to transform the original data to X'^ =Ti{Xi) and 
Y- = T2(Yi), where Ti and T2 are increasing functions, and then use (X-, Y-) 

instead of the original observations {Xi,Yi) in the estimator Cn ■ The 
aim of the transformation is to simplify the kernel estimation of the joint 
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distribution. As the direct choice of functions Ti, T2 is difficult, we pro- 
pose the following procedure. Let us first construct the uniform pseudo- 
observations uf'^ = ■:^Fn{Xi) and vj;^'^ = -;;^Gn{Yi). Then, for a given 

distribution function put Si = ^-^{U^) and T- = ^-^{vj^^^). Finally, 
use these transformed pseudo-observations {Si,Ti) instead of the original 
observations {Xi,Yi) in the estimator (5) of the joint distribution function. 
As we know, the marginals to be given by the function the suggested esti- 
mator has, in the case of multiplicative kernel, the following simple formula: 



(15) 



X K 



The advantage of this estimator is that it is not affected by the marginal 
distributions. Further bias calculations show that, if we choose <1>, such that 
-^(-y- is bounded, we take care of the "corner bias problem" that is present 
if we try to estimate the joint distribution of pseudo-observations directly. 
The above condition is satisfied, for example, for $ the normal cumulative 
distribution function. 



2.4. Main results. The main theoretical contribution of this paper is the 
weak convergence of the kernel estimators c'^^^ , C^^^^ , C^'^^ , C^^^^ and 

For notational convenience, let us denote F„ and (?„ the estimates of the 
marginals that are used to construct pseudo-observations; that is, in the 
following we will write Ui = Fn{Xi) and Vi = GniXi)- For the weak conver- 
gence results we need these functions to be asymptotically equivalent to the 
empirical cumulative distribution functions -F„, G„; that is, 

SUp|F„(x) -Fn{x)\ =Op(-^j=], 



(16) 

sup I (?„(?/) -Gn{y)\ =Op(^-^=^, 

which further implies the standard weak convergence of the processes y/n{Fn - 
F) and ^Jn{Gn — G) to particular Brownian bridges. For technical reasons, 
we will also suppose that the functions Fn and Gn are nondecreasing, which 
excludes higher order kernels (taking negative values) for the estimation of 
the marginals. 
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It is easy to see that (16) is satisfied if we define pseudo-observations as 
= ^nF^i^f)-\ V. = or in a way given in (3). 

If we decide for kernel smoothing of the marginals given in (6), then it 
is well known [see, e.g., Lemma 7 of Fermanian, Radulovic and Wegkamp 
(2004)] that assumption (16) is met if there exists a > such that, uniformly 
in X, 

F{x + hm) = F{x) + hmfix) + o(6}+") with ^ 0, 

where / denotes the derivative of F and, similarly, for G involving 62n- 

') 

copula processes; that is, for {u,v) E [0,1]^ 



Let £-^n^\ £.^n^^\ C^i^^\ ci^^^\ C^'^ be suitably normalized empirical 



C^;} = y/^{Cl-^-C{u,v)}. 

The proof of the following theorem is given in Appendix A. The termini- 
nology on stochastic processes (e.g., pinned C-Brownian sheet) is taken from 
Tsukuhara (2005). We refer the reader to this reference for details on the 
concepts used. 



Theorem 1. Suppose that H has continuous marginal distribution func- 
tions and that the underlying copula function C has hounded second order 
partial derivatives on [0,1]^. If = 0{n~^^^) and (16) is satisfied, then 
the (kernel) copula processes Cn ^ converge weakly to the Gaussian 

process Gc in £°°([0, 1]^), having representation 

(17) Gc{u, v) = Bc{u, v) - Cuiu, v)Bc{u, 1) - a(n, v)Bc{l, v), 

where Cu and G^ denote the first order partial derivatives of G, and Bq is a 
two-dimensional pinned G -Brownian sheet on [0, 1]^; that is, it is a centered 
Gaussian process with covariance function 

(18) E[Bciu,v)Bc{u ,v')] = G{u Au ,v Av') - G{u,v)G{u' ,v'). 

While Theorem 1 requires boundedness of the second order partial deriva- 
tives of the copula C, the weak convergence result of Fermanian, Radulovic 
and Wegkamp (2004) for the estimator Cn given by (4) requires bounded- 
ness of the second order derivatives of the original joint distribution function 
H. This may or may not be more stringent, depending on the marginals. Un- 
fortunately, Theorem 1 excludes many commonly-used families of copulas. 
The next theorem and Appendix D guarantee that the weak convergence of 
the proposed improved estimators ci^^^'' , ci'^^^^ , holds for commonly- 
used copulas such as Clayton, Gumbel, normal and Student copulas. 
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Remark. A careful reader may find out that all the published weak 
convergence results for the empirical estimator (2) or the smoothed empirical 
estimator (4) require smoothness of the first order partial derivatives Cu and 
C„ of the copulas C on [0,1]^. But this smoothness assumption usually is 
not true for the families which do not have bounded second order partial 
derivatives (e.g., Clayton, Gumbel, normal and Student). For instance, the 
first order partial derivatives of the Clayton copula are not continuous in 
the corner point (0,0). The second step of our proof given in Appendix B 
shows that it is sufficient to assume that 

(19) Cu, are continuous in [0, 1]^ \ {(0, 0), (0, 1), (1, 0), (1, 1)}. 



Theorem 2. Suppose that H has continuous marginal distribution func- 
tions and that the copula C has bounded second order partial derivatives on 
(0,1)^ and satisfies (9) and (19). If hn = 0{n~^/^) and (16) is satisfied, 

then the (kernel) copula processes C^^^^ and ci^^^^ converge weakly to the 
Gaussian process Gc in £°°([0, 1]^) given in Theorem 1. 

Moreover, if the functions and ^^^^-^ are bounded, then the above state- 

(T) 

ment holds also for the process Cn . 



3. Finite sample comparisons. 



3.1. Set up and performed comparisons. In our simulation study, we al- 
ways use the Epanechnikov kernel k{x) = |(1 — x^)I{|x| < 1} and the bivari- 
ate multiplicative kernel k[x, y) = k{x)k{y). The optimality of the Epanech- 
nikov kernel in kernel density estimation was proven in Epanechnikov (1969). 
For background information on multivariate kernels see, for example. Wand 
and Jones (1995) and Fan and Gijbels (1996). 

We investigate the performances of the estimators C^'^ , , C^^'^ , C^^^ 
and C^^^\ We do not include the estimator Cn^\ defined in (4), because 
this estimator is too strongly influenced by the marginals, which makes 
the comparison difficult. For example, for a normal copula with normal 
marginals, the estimator Cn usually does slightly better than its com- 
petitors. But, for a normal copula with, for example, exponential marginals, 
the performance of C\ is considerably worse than its competitors. We do 
not present results for the modification of the mirror-type estimator C^'^^'^ 
either, since its performance was found to be close to that of the estimator 

A(LLS) 

The performances of the various estimators were evaluated using two crite- 
ria: a Kolmogorov-Smirnov distance KS^ and a Cramer-von Mises distance 
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CM„; that is, 

KSn = sup|C'„(u,-y) - C{u,v)\, 

u,v 
n 

1=1 

where C„ stands for any of the investigated estimators, for example, CA • 

(V) 

The corresponding statistics are denoted accordingly, for example, KSn and 
CMr^ Originally, we included the mean integrated asymptotic error — 
n JJ[Cn{u, v) — C{u, v)]'^ dudv as well. Not surprisingly, this measure behaves 
similarly to the Cramer-von Mises distance, since CM„ KinJJ {Cn — C)^ dC, 
but it is not so sensitive to the bias of the underlying copula estimator. See 
also Section 4. 

For computational reasons, the supremum in the Kolmogorov-Smirnov 
distance KS„ was replaced by a maximum over a grid of 101 x 101 points. 

3.2. Bandwidth choice. The estimator C^^^ involves bandwidths bni 
and bn2 (for estimation of the marginals) as well as a bandwidth /i„ when 
using local linear fitting to estimate the copula. Preliminary simulation re- 
sults confirmed the results of Chen and Huang (2007), that the estimator 

cit^^ [as well as its modification C^^^^] cannot be improved by smooth- 
ing the marginals. Therefore, we simply work with the pseudo-observations 

= ^Fn{Xi) and = ^GniYi), with F„ and G„ the empirical 
cumulative distribution functions. This slightly differs from the strategy of 
strong undersmoothing recommended in Chen and Huang (2007), which 
more or less results in taking Ui = '^'^^^'^^)~^ a,nd Vi = '^^^"(^^')~^ ^ Never- 
theless, the behavior of the resulting estimators is very similar. 

For choosing the bandwidth /i„ for C^^^ and C^^^\ we rely on the 
expressions for asymptotic bias, variance and MISE derived in Chen and 
Huang (2007). From the main (Asymptotic) terms in (11) and (12), we 
derive the asymptotic mean squared error of the copula estimator in a given 
point {u,v) 

(20) AMSE{Cn{u,v)} = AYav{Cn{u,v)} + [ABias{C'„(n, z;)}]^. 

An optimal bandwidth is obtained by minimization of // AMSE{C„(u, 
v)} dC{u,v). As the true copula is unknown, this minimization cannot be 
carried out. A possible approach is then to consider a so-called reference 
copula. Chen and Huang (2007) proposed using a t-copula as a reference 
copula. But, as the second derivatives of the t-copula are not bounded, we 
experienced numerical difficulties and instabilities trying to apply this ref- 
erence rule. We therefore decided to use Frank's copula, which has bounded 
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second derivatives. The unknown parameter in Prank's copula family is es- 
timated by inversion of Kendall's tau. The computational simplicity of this 
approach also makes the goodness-of-fit testing procedures, presented in 
Section 4, much more feasible. 

Since the shrinkage of the bandwidth in the estimator cit^^^ removes the 
problem of possible unboundedness of the second order partial derivatives, 
there are plenty of families of copulas to use as a reference copula for this 
estimator. For simplicity and for more appropriate comparisons, we also use 
Frank's copula reference for C^^^\ 

The asymptotic expansions (11) and (12) hold for the mirror- type kernel 
estimators C^^^ and C^^^^ as well; hence we also rely here on the same 
choice for hn- 

For the two improved estimators, a Frank copula based reference selection 
rule seems to give quite good performance (see Sections 3 and 4). A normal 
copula based reference rule tends to result in a too large bandwidth, whereas 
a Clayton copula based reference rule tends to give, on average, too small 
bandwidths. Thus Frank's reference rule seems to be a good compromise. 

More problematic is the bandwidth choice for CA , as we do not have 
asymptotic expressions for bias and variance here. We tried to minimize the 
expected mean squared integrated error 

r+oo r+oo 

(21) / / [Hn{x,y)-H{x,y)fh{x,y)dxdy 



taking a bivariate normal distribution H, with corresponding density h, as a 
reference distribution [see Jin and Shao (1999)]. However, the resulting band- 
width selector turned out to be too big. A possible explanation is that such a 
selection rule does not take into account that we rely on pseudo-observations 
{Ui,Vi) instead of on the unobservable {Ui,Vi). In our simulation study, we 
then used the above mentioned bandwidth divided by a factor two. This 
seems to be a reasonable ad-hoc solution. To further investigate the band- 
width selection problem, for Cn , we calculated the ratio of the bandwidth 
selected via (21) to the one selected via searching for a bandwidth that min- 
imizes the criterion KS„(/i) [resp., CM„(/i)] over a grid of /i-values. Table 1 
summarizes the obtained average ratios, for various values of Kendall's tau 
from 2000 simulated samples. Note that the ratios stay quite stable across 
different families of copulas as well as for different sample sizes. This sug- 
gests that it may be possible to find a reliable reference-based rule for CA 
as well. 

The simulation studies reported below showed a promising performance 

" (T) 

for the transformation estimator Cn . A good bandwidth selection rule is 
missing, for the moment, and is subject of further research. 
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Table 1 

Average ratios of bandwidths for cl^'^ selected from minimizing (21) and from criteria 
KS>n{h) and CM„(/i), respectively, for different Kendall's r and sample sizes n 







Clayton 






Frank 






Normal 






T — 


0.25 r = 


0.75 


r = 


0.25 r = 


0.75 


r = 


0.25 T = 


0.75 




KSn 


CM„ KS„ 


CM„ 


KS„ 


CM„ KS„ 


CM„ 


KS„ 


CM„ KS„ 


CM„ 


n = 50 
n= 150 


1.20 
1.13 


1.28 1.53 
1.27 1.38 


2.00 
2.08 


1.23 
1.21 


1.38 1.53 
1.47 1.55 


2.04 
2.07 


1.28 
1.20 


1.37 1.26 
1.36 1.24 


1.62 
1.60 



3.3. Simulation results. An extensive simulation study was carried out to 
compare the performances of all estimators using the performance measures 
KS„ (the Kolmogorov-Smirnov distance) and CM„ (the Cramer-von Mises 
distance). To illustrate our main findings, we only report on results obtained 
for the following two simulation models: 

Model 1. Frank copula with Kendall's r = 0.25; 

Model 2. Clayton copula with Kendall's r = 0.75. 

Models 1 and 2 represent very different copula functions. The copula in 
model 1 has bounded second order partial derivatives and presents a case 
of mild dependence, whereas the copula in model 2 has unbounded second 
order partial derivatives and shows a strong dependence between X and Y . 
From each model, we simulated 10,000 samples of sample size n = 150. 

Figure 1 shows the boxplots of the performance measures KS^ and CM„ 
for model 1 (top panels) and model 2 (bottom panels). Note that, for model 
1, the estimators C^^\ c2^^\ C^^^^ and Cn"""^ perform very comparable 
for the Cramer-von Mises distance measure. For the Kolmogorov-Smirnov 
performance measure, the estimator CA performs slightly but significantly 
worse than the three other estimators. The latter is likely caused by the usage 
of a too small bandwidth, as can be anticipated by looking at the different 
results for the two performance measures and Table 1. For model 2 (bottom 
panels), one clearly sees a better performance of the improved estimators 

C^^^^ and ci^\ especially when looking at the performance measure CM^. 
This is as to be expected, since the copula in model 2 has unbounded second 
order partial derivatives and the measure CM„ ~ n //((?« — C*)^ dC is most 
affected by points with higher values of the copula density c(n, v) (which 
usually correspond with points with higher values of the second order partial 
derivatives Cuu and C„„). In other words, the performance measure CM„ 
is more sensitive to the corner bias problem than the measure KS„. For 
model 1, there was no need for "shrinking" the bandwidth since the copula 
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1 



(2) Kolmogorov 



(2) Cramer 




u 



KS„^ KSJ KS„'-'- KS"" 



CM„ CM„ CM„ CM" 



Fig. 1. Boxplots of quantities KS„ and CM„ for different copula estimators. Top panels: 
model 1; Bottom panels: model 2. 



function has bounded second order partial derivatives. Nevertheless, the 
"shrinked-bandwidth" (improved) local linear estimator also performs very 

well for this model. The very promising performance of the transformation 

" (T) 

estimator CA in view of the Cramer-von Mises distance CM„ may be 
partially understood by the fact that, in view of Table 1, our ad- hoc rule of 

" (T) 

bandwidth choice for the estimator Cn is almost optimal in that situation. 

From the more extensive simulation study, we further report the following 
observations. All kernel copula estimators usually improve upon the empiri- 
cal estimate Cn ■ For copulas with bounded second order partial derivatives, 
the performances of the estimators C^^'^ , C^^'^ and Cn^^ become very com- 
parable, especially with increasing sample size. Overall, c2^^^ works slightly 
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better for copulas with bounded second order partial derivatives [e.g., Prank, 
Farlie-Gumbel-Morgenstern, Ali-Mikhail-Haq; see Nelson (2006)] and mild 
dependence, with a significant improvement for copulas very close to inde- 
pendence copulas. On the other hand, the local linear kernel estimator C^^^ 
is preferable [compared to C^^'^] in the remaining cases. 

To gain further insights in the kernel estimators, we examined the depen- 
dence of these estimators on the bandwidth. We again use models 1 and 2 
to illustrate our findings. For brevity, we present results only for the esti- 
mator C^^\ since similar findings can be reported on for the other kernel 
estimators. 

Figure 2 illustrates the performance of the copula cit^^ with a fixed 
bandwidth h, in view of the performance measures KS„ and CM„, for models 
1 and 2 (top and bottom panels, resp.). For comparison purposes, we also 
include (at the far left of the horizontal axis) the boxplot summarizing the 
results for the empirical copula Ck ■ In addition, we provide in each picture 
a (vertical) boxplot that indicates the bandwidths selected for C^^^ via 
(20). 

Note that the effect of bandwidth choice is most noticeable from the 
Kolmogorov-Smirnov quantity KS„. This is particularly true for the Clay- 
ton copula, model 2. For model 1, the estimator c2^^^ improves upon the 
empirical copula Cn for almost all /i-values in the considered range of val- 
ues. For model 2, however, which presents a case of stronger dependence, a 
kernel estimator comes with a gain, but only for a carefully selected band- 
width. 

From the vertically displayed boxplots of bandwidths selected, we can 
further remark that a bandwidth selected via (20) works in fact quite satis- 
factory. This is particularly true in case of mild dependence and for copulas 
with bounded second derivatives (such as model 1). It may lead to a slight 
oversmoothing in a situation of strong dependence and for copulas with un- 
bounded second derivatives (cf. model 2). It is worth mentioning though, 
that the presented results for model 2 are almost among the "worst-case" 
scenarios here. 

4. Goodness-of-fit tests for copulas. When modelling multivariate data 
using copulas, a popular method is to estimate marginals nonparametrically 
and the copula in a parametric way. This requires choosing a suitable family 
of copulas for the data at hand, which is not an easy task. In this section, 
we focus on testing the null hypothesis 

Hq-.C e Co, 

where Cq = {Cg,9 G 0} is a given parametric family of copulas. 
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n 1 1 1 1 1 1 r 

0.1 0.2 0.3 0.4 



(2) Kolmogorov 



e 8 
i 1 



g 



iiiiW 



KS„ 



I I I I I I I I I 
0.04 0.08 0.12 0.16 0.2 








III 



I 

111 



0.1 0.2 
h 

(2) Cramer 



I I I r 

0.3 0.4 



-MEM — ■ 



Hllil 



I I I I I I I I I I I I 

CM„'^ 0-04 0.08 0.12 0.16 0.2 



Fig. 2. Boxplots of the quantities KSn and CM„ for different values of fixed bandwidths 
for the estimator Cn^\ and boxplot (far left) for the estimator C'^\ Top panels: model 
1; Bottom panels: model 2. 



Many testing methods have been proposed. See, for example, Chen and 
Huang (2007) and the review paper of Genest, Remillard and Beaudoin 
(2008). The latter paper included a simulation study on classical goodness- 
of-fit measures such as the Kolmogorov-Smirnov and the Cramer-von Mises 
statistics, which we denote by (allowing a small abuse of previous notation) 

KS(^^=snp\Cj^\u,v)-C^Ju,v)l 

u,v 

(22) 

CmP =J2lCi!'HUi,Vi) - C^JU..V.)f. 



i=l 
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where On is an estimate of the unknown parameter 9q based on the inversion 
of the observed Kendah's tau. 

The aim of this section is to investigate the size and power properties 
for testing procedures based on the test statistics KSn^^\ KSn'"'^^\ CMn^'^'' 
CMi'"^^-' computed by replacing Cn^^ in (22) with C^^"^ or C^^^\ In addi- 
tion, we consider here the test statistic 

Q(f) =11 [C^^\u,v) - C^^{u,v)fdudv 

and its cit^'^ and (7^^^^^ versions. The double integral in the definition of 
was approximated by a double sum over a grid of 101 x 101 points. 
Since the asymptotic distributions of these test statistics are too complex, 
a parametric bootstrap is used. This procedure runs as follows: 

(1) By inversion of the empirical Kendall's r, estimate the unknown param- 
eter 9 of the null hypothesis family by On and compute the test statistic 
K.sV [where the superscript (•) refers to any of the considered estimators 
of the copula] ; 

(2) Generate {{Ul ,V*)}f^i from the copula Cg and use them as original 

*(■) 

observations to compute 0* and KSn ; 

(3) Repeat step (2) S-times; 

(4) Estimate the p- values as 

_ 1 + #{KS;" >KS^.^} 
^KS^;' B + 1 

See Davison and Hinkley (1997). 
For any of the other test statistics, we proceed similarly, replacing KSn^ by 

CM^;) or qi^K 

According to Genest and Remillard (2008), the validity of this bootstrap 
procedure requires the weak convergence of the copula processes cil"^^ and 
^(LLS)^ For the latter process, the weak convergence is justified for all copula 
families considered in our simulation study, by Theorem 2. In contrast, only 
for Frank's copula the condition of Theorem 1 is satisfied when dealing with 

the weak convergence of the process ci^^\ Lemma C.l of Appendix C shows 
that the test based on C^^^ holds asymptotically the level even for the other 
families Co appearing in the simulation study. 

The setup of our simulation study closely follows that of Genest, Remillard 
and Beaudoin (2008). The sample size is n = 150, and we take 999 num- 
ber of bootstrap samples. Three values of Kendal's tau are considered, 
namely r = 0.25, 0.50, 0.75, for the following copula families: Clayton, Gum- 
bel, Frank, normal and Student with four degrees of freedom (df). We use 
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the R-computing environment, version 2.5.0 [see R Development Core Team 
(2007)], with copula package [see Yan (2007)]. For approximating the level 
of the test (i.e., under the null hypothesis) we use 6000 repetitions. The 
estimated powers of the test statistics are based on 1500 repetitions. 

The results of the simulations are presented in Tables 2, 3 and 4. For 
ease of the reader, the estimated values for the size of the test statistics are 
presented in italics. Furthermore, for each testing problem, we highlighted 
the "best" power performances using bold characters. Readers should be 
aware of the fact that these estimated powers and sizes (using, resp., 1500 
and 6000 repetitions) are of course subject to Monte Carlo approximation 
errors. A conservative upper bound (relying on a binomial distribution with 
parameters B and p) for these approximations errors (in terms of standard 
deviation) is for the size estimates 0.28% (using B = 6000 and p = 0.05) and 
for the power estimates 1.29% (using B = 1500 and p = 0.5, for getting to 
an upper bound). 

A summary of conclusions from the simulations results is as follows: 

• The use of a kernel estimator [e.g., C^^^'^] in goodness-of-fit testing seems 
to be promising in case true copulas are in the Clayton, Gumbel and Frank 
families, and we consistently improve upon the power for the Kolmogorov- 
Smirnov test; 

• If a kernel estimator improves upon the power, it is most noticeable when 
the dependence is weaker, and is greatest for the Kolmogorov-Smirnov 
test; 

• The power of the test statistics Qn is usually somewhere between the 
power of Kolmogorov-Smirnov and the Cramer-von Mises test statistics; 

• The power of the test statistics based on the improved estimator C^^^^ is 
usually higher than this for test statistics based on C^^^ for alternatives 
with unbounded second order partial derivatives; 

• For a true Prank copula and a Cramer-von Mises test statistic, the esti- 
mator C^^^^ is usually the best choice; 

• The use of kernel estimators seems to be promising for Archimedean fam- 
ilies of copulas (Clayton, Frank, Gumbel) but is somewhat questionable 
for elliptical families of copulas (normal. Student). Although kernel esti- 
mators may improve the power against Clayton and Gumbel alternatives, 
a loss in power is noticed for Frank alternatives. This holds in particular 
for C^^^^^-based statistics. 

APPENDIX A: PROOF OF THEOREM 1 

For simplicity, we will suppress the dependence on n in the notation of 
pseudo-observations {Ui, V^)"*" and write, simply, Ui = Fn{Xi) = Fn{F~^{Ui)) 
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and Vi = Gn{Yi) = Gn{G^^{Vi)), where {Ui,Vi) have a joint distribution 
function given by the copula C. 

As our proof is a straightforward adaptation of the ideas used in van 
der Vaart and Wellner (2007), we would like to clarify one point. In the 
following, we will encounter the expectations of the form Fig{U,V), where g 
is a measurable function on [0,1]^ and f/ = V = Gn{G-'^{V)). 

In these types of expectations, the estimators of the marginal distribution 
functions Fn,Gn are considered to be fixed (nonrandom) functions, and the 
expectation is taken only with respect to (^7, V) with joint distribution given 
by the copula G . Formally, 

E5(f7, = Ec/,y I (Xi , Yi) , . . . , (X„ , , 

whenever the integral on the right-hand side exists. 

A.l. Weak convergence of the process C(,LL). In view of the previous 
remark, we decompose 

^{C^^'^\u,v)-G{u,v)) 



(23) 



where 



(24) 



(u, v) + Bn{u, v) + {u, v) , 



C{u,v) 



u-Uj 



K. 



v,h„ 



v-Vi 



.1=1 

-l{U,<u,V,<v} 
^/^ fu-U\^^ fv-V 



G{u,v] 



and 

(25) Bn{u,v) 



(26) C^"(n,7;) = V^E 



Y.[I{U,<u,Vi<v}-G{u,v)], 
u-U\^^ (v-V 



1=1 



K 



h y""'^"V h 

Our proof will be divided into two steps. First, we will show, in Step 1, 
that sup^^, |^^"| = Op(l). Then, we will prove, in Step 2, that 

sup\Gt{u,v) - ^lG{u,v)^/^[F*{u) - u] - ^2G{u,v)^/^[Gl{v) - v]\ = op{l), 



C{u,v) 
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Table 2 

Percentage of rejection of Ho by various tests for samples of size n - 
different copula models with r = 0.25 



150 arising from 



Copula 

under 

Ho 



True 



Cramer-von Mises Kolmogorov-Smirnov 



MISE 



copula CMi^^ 


CM„ 










Qn 


Qn 


^(LLS 


Clayton 


4.y 


4.3 


4.9 


5.1 


3.8 


4.5 


5.0 


4.9 


5.6 


Gumbel 


OD. ( 


91.3 


93.3 


61.1 


81.8 


84.6 


84.1 


91.7 


93.6 


Frank 


04.4 


63.5 


61.4 


33.5 


64.1 


61.2 


51.6 


62.2 


59.7 


"Di n^ij-^+^- 


OD.o 


61.0 


63.5 


34.2 


57.5 


57.2 


53.0 


61.8 


62.7 




49.9 


59.0 


61.6 


28.2 


54.8 


52.8 


46.6 


59.6 


62.5 


Student, 4 df 


55.9 


65.5 


67.0 


34.6 


40.8 


40.7 


52.1 


68.0 


69.5 


Clayton 


( Z.Z 


85.2 


86.7 


48.3 


77.5 


78.1 


74.0 


83.3 


85.6 


Gumbel 


4.y 


5.1 


5.6 


5.0 


4.9 


4.7 


4.5 


4.9 


5.4 


Frank 


^ A 1 
14. ( 


18.2 


20.1 


11.0 


18.2 


22.8 


18.4 


15.7 


17.3 


T>i r, „i --^.j-j- 


io. ( 


16.7 


17.4 


9.5 


17.7 


17.9 


16.2 


15.7 


16.3 


Normal 


inn 


15.6 


15.5 


8.2 


16.8 


16.1 


12.6 


15.5 


14.1 


Student, 4 df 


12.8 


21.5 


22.6 


7.3 


11.3 


13.3 


14.3 


23.3 


24.5 


Clayton 


41.3 


49.9 


52.1 


25.5 


37.6 


39.1 


42.1 


48.2 


50.8 


Gumbel 


31.8 


47.5 


50.1 


18.1 


28.5 


28.3 


25.4 


45.7 


47.5 


Frank 


4.7 


4.9 


4.9 


5.1 


4.9 


4.5 


4.6 


4.6 


5.1 


Plackett 


5.7 


6.9 


6.2 


4.7 


5.0 


5.8 


5.4 


6.7 


7.0 


Normal 


8.5 


13.4 


12.6 


9.7 


12.1 


11.5 


7.6 


14.4 


12.8 


Student, 4 df 


18.1 


34.5 


33.9 


10.1 


18.1 


17.1 


15.4 


36.6 


35.2 


Clayton 


34.3 


36.9 


40.1 


18.3 


25.6 


28.7 


36.2 


36.0 


38.9 


Gumbel 


26.1 


35.1 


33.7 


12.0 


20.0 


18.7 


21.0 


34.0 


32.6 


Frank 


7.6 


3.7 


3.9 


7.1 


3.4 


4.3 


7.9 


4.2 


4.3 


Plackett 


8.1 


4.5 


5.4 


8.0 


3.6 


5.1 


8.6 


5.7 


6.1 


Normal 


4.8 


5.2 


5.5 


5.0 


5.2 


4.5 


5.8 


4.9 


5.1 


Student, 4 df 


11.7 


14.8 


14.5 


6.5 


9.2 


9.0 


9.5 


17.9 


19.1 


Clayton 


29.0 


29.5 


34.1 


22.1 


33.2 


36.3 


32.8 


26.2 


30.2 


Gumbel 


20.7 


26.1 


26.6 


12.1 


22.7 


25.5 


17.9 


17.9 


22.7 


Frank 


9.0 


7.1 


6.1 


9.1 


8.0 


9.2 


9.7 


3.6 


3.4 


Plackett 


7.6 


6.4 


4.5 


8.6 


6.6 


7.7 


7.6 


3.9 


2.9 


Normal 


4.7 


4.3 


3.5 


6.4 


6.9 


6.7 


4.9 


3.9 


2.9 


Student, 4 df 


4.7 


4.7 


4.9 


5.3 


4.8 


4.7 


44 


4.7 


5.4 



Clayton 



Gumbel 



Frank 



Normal 



Student 



where F*{u) = ^ Y.l=i ^U, < u} and GUv) = ^ E'Li HVi < v}- The conver- 
gence of the smoothed copula process C^^^^ = \/n{C^^^ — C) to a Gaussian 
process given by (17) will now follow by the results of Fermanian, Radulovic 
and Wegkamp (2004). 

Step 1. We consider the following class of functions from [0, 1]^ to [0, 1]: 
J=^= Uwi, W2) ^ Ku,h ( I Kv,h ( I , 
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Table 3 

Percentage of rejection of Ho by various tests for samples of size n — 150 arising from 
different copula models with r = 0.50 



Copula 

under 

Ho 



True 
copula 



Cramer-von Mises Kolmogorov-Smirnov 



MISE 



Clayton 



Gumbel 



Frank 



Normal 



Student 



Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 df 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 df 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 df 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 df 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 df 



5.3 
99.9 
95.9 
95.6 
94.4 
94.9 

99.6 
4.5 
40.5 
29.4 
18.8 
22.3 

89.6 
63.8 
5.3 
8.4 
19.6 
35.0 

83.0 

41.7 
21.2 
12.0 

4.8 
8.1 

80.6 
36.5 
28.5 
13.5 

5.1 
4.7 



5.2 
100.0 
96.4 
96.7 

96.3 
96.7 

99.7 
4.7 
48.2 
33.3 
26.4 
27.8 

88.9 
71.0 
4.9 
10.4 
26.0 
44.9 

78.3 
39.5 
20.1 
7.4 
5.1 
6.4 

78.4 
39.1 
30.5 
13.5 
6.4 
4.9 



5.4 
99.9 
96.2 
96.0 
96.9 
97.2 

99.7 
4.9 
39.7 
30.7 
25.1 
29.2 

91.9 
74.1 
5.2 
12.1 
29.5 
52.8 

82.9 
44.3 
14.9 
7.8 
5.5 
8.3 

83.7 
39.6 

23.1 
8.9 
5.5 

5.1 



5.5 
98.9 
82.5 
75.3 
75.0 
77.9 

94.3 
5.0 
29.6 
18.9 
14.6 
11.7 

68.1 
39.3 
5.1 
5.4 
17.6 
17.9 

55.8 
18.3 
15.1 
7.7 

4.7 
4.0 

62.1 
20.5 
18.8 
10.0 

7.6 

4.9 



5.3 
99.5 
98.3 
94.7 
91.9 
86.4 

98.7 
4.9 
48.1 
26.5 
25.3 
19.1 

72.7 
44.6 
5.2 
6.9 
26.9 
27.8 

66.6 

22.7 
11.8 
5.8 
5.0 
4.5 

74.9 
31.3 
18.0 
6.9 
7.8 
5.0 



5.7 
99.9 
95.6 
89.7 
91.1 
88.2 

98.8 

5.4 
40.4 
22.8 
23.8 
16.5 

75.5 
47.3 

5.0 
6.7 
25.5 
29.4 

66.1 
26.6 
11.0 
4.5 
5.4 
4.8 

76.5 
32.4 
20.8 

7.3 
7.5 

4.9 



4.8 5.6 
100.0 100.0 
91.1 93.3 
92.3 95.7 



89.9 
92.7 

98.9 
4.8 
41.1 
31.0 
22.3 
23.6 

85.1 86.3 
50.5 68.9 
5.1 4.9 
8.3 15.2 

16.5 25.7 
29.0 51.0 

79.6 76.1 



94.5 
96.2 

99.3 
3.9 
35.3 
30.0 
22.1 
28.5 



32.4 
19.3 
13.4 

5.8 
7.6 

79.0 

25.7 
25.0 
11.0 
5.1 

5.2 



39.6 
14.4 
11.9 
4.5 
11.5 

72.3 
25.7 
15.0 
6.8 
3.4 
4.1 



5.8 
99.9 
92.3 
94.5 
94.9 
96.3 

99.1 
4.6 
30.6 
27.6 
21.8 
29.0 

85.5 
65.7 
5.0 
8.5 
17.5 
46.2 

79.4 
41.2 
10.5 
9.9 
4.6 
12.5 

75.0 
31.0 

11.3 
6.6 
4.0 

4.7 



(27) 



{u,v) £ [0,l]^/lG 



0, 



, Ci, C2 : [0, 1] ^ [0, 1] nondecreasing 



As each function / from JT is characterized by a quintuple (li, /i, ("i, C2)) 
the empirical process indexed by J- can be written as 



Znif) = Zniu,V,h,Cl,C2) 
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Table 4 

Percentage of rejection of Ho by various tests for samples of size n - 
different copula models with r = 0.75 



150 arising from 



Copula 

under 

Ho 



True 
copula 



Cramer-von Mises Kolmogorov-Smirnov 



MISE 



CM^f^ CM^rt^'> CMI^^^' KSi^^ KSi^^' KSi^^^^ 



^(E)^(LL)^(LLS) 



Clayton 



Gumbel 



Frank 



Normal 



Student 



Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 

Clayton 
Gumbel 
Frank 
Plackett 
Normal 
Student, 4 



5.3 
100.0 
98.8 

99.5 
99.8 
df 99.9 

99.9 

4.5 
53.3 
24.3 
12.4 
df 15.6 

96.7 
81.6 

44 
19.6 
40.7 
df 58.4 

93.4 

41.1 
46.1 
15.2 
4.7 
df 6.9 

92.8 
37.3 
52.2 
16.4 

4.5 
df 4.3 



5.6 
100.0 
98.3 
99.2 
99.5 
99.7 

99.5 
4.6 
54.5 
23.5 
13.1 
15.7 

91.3 
80.5 
4.6 
20.7 
41.1 
57.9 

88.9 
38.3 
46.7 

11.6 
4.8 
6.3 

89.4 
34.5 
51.8 
15.7 
4.7 
4.9 



5.3 
100.0 
98.5 
99.9 
99.9 
100.0 

99.9 

4.8 
47.0 
19.1 
13.9 
20.1 

96.9 
87.8 

4-4 

27.5 
52.7 
72.6 

90.5 
43.4 

37.7 
9.2 

4-4 
6.8 

89.9 
37.0 
45.5 
10.3 
3.5 
4-4 



5.1 
99.9 
83.7 
85.1 
90.5 
92.7 

95.8 
4.7 

25.6 
6.6 

11.3 
8.5 

57.6 
36.9 
4.7 
5.9 
28.5 
26.7 

66.7 
13.0 
17.5 
3.1 
4-4 

4.7 

73.7 
17.3 

24.5 
4.3 
6.3 
4.7 



4.9 
100.0 
98.1 

90.6 
93.6 
91.4 

98.5 

4.6 
38.1 

9.3 
13.6 

10.4 

63.7 
41.7 
4.5 
8.7 
33.3 
33.6 

77.9 
19.6 
18.2 
3.1 
4.6 
3.9 

84.4 
26.3 
28.1 
3.9 
6.9 
5.1 



4.6 
100.0 
97.2 
93.9 
96.8 
97.7 

98.5 
4.9 

38.3 
9.3 

13.3 

10.8 

64.9 

39.7 
4.8 
7.5 
30.2 
30.3 

79.7 
20.9 
22.9 
4.0 

4.7 
4.2 

86.7 
26.9 
33.6 
5.8 
7.8 
4.9 



3.3 4.6 
100.0 100.0 
94.0 95.4 



97.0 
98.0 
98.5 

99.6 
5.0 
40.7 
29.5 
12.3 
16.2 

90.8 

61.1 
4.7 
34.5 
31.9 
50.6 

88.4 

24.3 
31.0 



98.3 
98.4 
99.5 

99.1 
3.2 

26.6 

24.9 
8.3 

15.1 

86.2 
73.7 
3.0 
45.7 
38.7 
63.1 

83.3 
33.5 
24.1 



24.4 27.7 

5.2 3.5 



7.0 

85.8 

18.4 
30.4 
5.8 
2.5 

4.8 



10.1 

75.6 
18.4 
20.7 
12.7 
2.1 
3.6 



4.0 
100.0 
95.1 
99.1 
99.2 
99.5 

99.0 
3.7 

23.7 

25.1 
7.7 

14.9 

89.0 
75.3 
3.7 
48.3 
42.9 
64.6 

82.6 
35.0 

18.4 
23.7 
3.5 
9.6 

74.8 
21.6 

14.5 
11.6 
1.9 

3.1 



^v^h V — h — ) ^^''^ [ — ir- 

Put Zn = Zn — FiZn and note that 

Atiu,v) = ZMl)-Znif2), 

(28) 

where /f = (n, z;, /i„, F„(F-i), ^ = {u,v,0,I,I) 

with I being the identity function on the interval [0, 1]. 
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Lemma A.l, which is given below, states that the set of functions T 
is Donsker. Indeed, JP" is a subset of T* in Lemma A.l, taking = 1 
and uq = u and vq = v. This implies the weak convergence of the process 
^n{f), f ^ J^, which further implies that the process Zn is asymptotically 
uniformly p-equicontinuous in probability [see pages 37-41 of van der Vaart 
and Wellner (1996)] with semimetric p given by 



p(/,/') = E 



u,h 



h 



K 



v,h 



V-C2iV) 



h 



u'h' 



v'h' 



1 2 



Using this asymptotic uniform p-equicontinuity and (28), we get that 
sup„ ^ I^Jj"! = Op{l), provided that sup„ ^ pifi, fi) converges to zero in prob- 
ability, where /f and /2 are given in (28) [for details consult the proof in 
van der Vaart (1994)]. 

Put M — sup^ J. I ^/i(x)|, where k^^h is defined in (7), and denote 

A^ = {\JJ -U\>e oi\V -V\> e}. 

The consistency of F„ and Gn yields that, for every e > 0, for all sufficiently 
large n. 



max|sup|F„(3;) - F(3;)|,sup |G„(y) - G{y)\\ 



> e 



which further implies that, for all sufficiently large n, 

(29) P{Ae) = P[max{|F„(X) - F{X)\,\Gn{Y) - G{Y)\} >e]<e. 

Now, we can bound 



p(/r,/2 



E 



<E 



U.hn 



u,h 



u-U 

hn 

U-U 

h 



V,hn 



v-V 



I{U <u,V <v} 



v,h 



V-V 
h 



1{U <u,V <v} 



Has + Ia, 



< M^E[|I{J7 <u- hn} - I{U < u}\ 



+ \I{U<u + hn}-I{U<u}\ 

+ \I{V<V-hn}-l{V<v}\ 

+ \I{V <V + hn}- I{V < v}\]Iac + I^, 

<4M\e + hn) + lA.. 

As the above bound holds uniformly in {u,v) and by (29), for all sufficiently 
large n, we have -P(^e) < £■ Since e can be arbitrarily small, this implies 
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Lemma A.l. Suppose that the function k is of bounded variation and 
J k{x) dx = 1. Then, the set of functions from [0, 1]^ to [0, 1] 

{uo,vo),{u,v) G [0,1]^ 

1" 



"•4 



, Cii C2 : [0, 1] [0, 1] nondecreasing j, 



where b{w) = 1 or b{w) = min{y^, y/1 — w}, is Donsker. 
Consequently, the family T in (27) is Donsker. 

Proof. Note that the class of functions 

Gi = {{wi,W2)^l{Qi{wi) < a,C2{w2) < h}, 

a, 6 G M, Ci, C2 : [0, 1] — > [0, 1] nondecreasing} 

is a subset of the class of indicators 

Q2 = {{wi,W2)^l{wi < {<)a,W2 < (<)&}, a, 6 G M}. 

But this implies that Qi is a Donsker class [see Example 2.5.4 of van der 
Vaart and Wellner (1996)]. 

As the set Qi is closed under translation, we know by the beginning of 
the proof of van der Vaart (1994) that the set of functions 

(30) f{x + y)dfi{y),fegi,fieMB^ 

is a Donsker class, where Mb a family of all signed measures (on M?) of 
total mass bounded by a fixed constant B. 
Let us introduce the set of signed measures 



= |(-oo,it;i] X {-00, W2] ^ Kuo,h ^^o,h 



b{vo)h 
iuo,vo) G [0,l]^/^G 



1' 
°-4 



If k is of bounded variation, then, by taking sufficiently large B, we ensure 
that A^o C Mb- Further, if x stands for {wi,W2) and y for (2/1,2/2), then, 
for f £Gi and fi G A^O) we get 

f{x + y)dn{y) 
= j j l{Ci{wi)+yi<u, 
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which is a Donsker class. As JF* includes J-' of (27) (consider lig = u and 
vq = v), the family is a Donsker class as well. □ 

Step 2. Now, we can turn our attention to the process C^" given by 
(26). For (it,?;) eM^ define 

Cy{u,v)=C{F{F~\u*)),G{G^Hv*))), 

where w* = max{min{7i;, 1}, 0}. Note that 

EI{U <u,V<v} = EI{U < F(F~^(n)), y < G{G;,\u))} + 0{n^^) 

= C^ . iu,v)+Oin-'), 

where the remainder term 0(n~^) disappears if we do some smoothing on 
the first stage; that is, if 6in,&2n > 0. As 



u-U 



h 

fl rl 



E J ^I{Ui <u-thn,Vi<u- shn}ku,h{s)kv^hit)dtds 



7' 



A, {u-thn,v - shn)ku,h{s)K,h{'t)dtds + 0{n 7, 



it will be useful to have a closer look at the process {G*~ - {u,v) G [0, 1]^}. 
In the following, we will prove that, uniformly in {u,v), 
Vn{G*p A {u,v) - G{u,v)) 

1 

(31) =-Cu{u,v)^Y.^l{U,<u]-u] 

V™ i=l 



1 

C,{u,v)—Y,[l{V^ < v}-v]+Op{l). 



n . 
1=1 



Let D[a, b] be the Banach space of all cadlag functions on an interval [a, b] 
equipped with the uniform norm. 

Lemma A. 2. Let F be a continuous distribution function. Then, the 
map F F o F~^ as a map D[0, 1] i-^ 1] is Hadamard-differentiable at 
F = F tangentially to the set of functions 

aeEF = {a{x) = l3{F{x)),x E M,/? G C[0, l],/3(0) = (3{1) = 0}. 
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The derivative is given by —aoF^^. 

Proof. Let at converge uniformly to a G E^?. Put Ft = F + tat- As 

sup \Ft{Ft-\u)) - u\ = sup \Ft{u+) - Ft{u-)\, 

iiG[o,i] ue[o,i] 

the function F is continuous and the function at converges uniformly to a 
bounded and continuous function, we get that Ft{Ff^{u)) -u = o{t) uni- 
formly in u. 

Thus, we can calculate 

][F{Ff'){u) - F{F-\u)) + ta{F-\u))] 

= a{F-\u))-atiFf\u)) + o{l) 

= [a{F~\u)) - aiF,-\u))] + [a{Fi\u)) - at{Fi\n))] + o(l). 

As Qf — > a uniformly, the second term converges to zero uniformly in u. By 
using the representation a{x) = P{F{x)), we see that, to ensure a uniform 
convergence of the first term to zero, we need to show that F{Ff~^{u)) — > u 
uniformly. But this follows by a simple calculation, which yields 

\FiFf\u)) - F{F~\u))\ 

= \tat{Fi\u))\+o{t) 

< \t\\atiFi\u)) - aiFf\u))\ + \t\\aiFf\u))\ + o(t) = 0(t), 
uniformly in u. □ 

Remark. As (16) implies that \/n{Fn — F) converges in distribution 
to a Gaussian process B o F, where i? is a standard Brownian motion on 
the interval [0,1], the Hadamard-differentiability tangentially to E^? given 
in Lemma A. 2 is exactly what is needed to derive asymptotic distribution 
of the process ^/n{F{F~^{u)) — u). 

Similarly, we can prove that the mapping G ^ G o G"^ is Hadamard- 
differentiable at G = G tangentially to the set of functions 

Eg = {a{x) = (5{G{x)),x G M, /3 G G[0, 1], /3(0) = /3(1) = 0}. 

The proof of the following lemma follows easily by applying Lemma A. 2 
and the chain rule. 

Lemma A. 3. Let the copula G have continuous partial derivatives on 
[a, b] X [c, d\ C [0, 1]^, and F, G are continuous; then, the map (F, G) i— > G% =, 

as a map D[a,b] x D[a,b] i— >£°°([a,6] x [c,d]) is Hadamard- differentiate at 
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the point {F,G) = {F,G) tangentially to the set of functions (0:1,02) G E^? x 
Eg. The derivative is given by 

(32) (j)' (01,02) = -Cu o 01 o - a o 02 o G-^ = -Cu o pi - C„ o P2, 
where /3i = ai o and (32 = 02° G^^ . 

Lemma A. 3, together with Theorem 3.9.4 of van der Vaart and Wehner 
(1996), imply that representation (31) holds uniformly for (u,v) £ [a,b] x 
[c, d]. Unfortunately, many of the most popular families (e.g., Clayton, Gum- 
bel, normal) do not have continuous Cu and C^ at some of the points 
{(0,0), (0, 1), (1,0), (1, 1)}. The following lemma takes care of this situation. 



Lemma A. 4. Let the distribution H have continuous margins F, G and a 
copula function whose first derivatives are continuous on [0, 1]^\{(0,0), (0, 1), 
(1, 0), (1, 1)}. Then, representation (31) holds uniformly in (u,v) G [0,1]^. 

Proof. Suppose, for simplicity, that the point of discontinuity is only 
at (0, 0) [other points (0, 1), (1, 0), (1, 1) might be handled in a similar way]. 
Let us denote 

Zn(u,v) = y/n(C*p A (u,v) - C(u,v)) 

1 " 

(33) +C7Jn,t;)^5][I{?7,<n}-n] 

1 " 

+ C,(u,v)^Y.i^{V,<v}-v]. 
Let e > be given. As all the process 

1 

Xl(u) = ^[F(F-\u)) - u], Xl(u) = ^ EMf^. <u}-u], 

V™ i=l 

1 " 

Xl(u) = ^[G(G-\u)) - u], X^(n) = ^ ^ <u}-u] 

i=l 

converge to a Brownian motion, we can find 6^ and such that, for all 
n > Ue, 

P(s.p|xi(„)|>i)<|, i = l....,4. 

As Gu,Cy are bounded by 1, the triangular inequality implies that, for all 

n > Us, 

P( sup \Zn(u,v)\>e)<Y,p(sup\Xi(u)\>^) <e. 
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Next, the existence of n'^ such that, for all n> n'^, 

P( sup \Zniu,v)\>e) <e with A, = [0,lf \[0,5e]^ 

follows by Lemma A. 3 applied to rectangles [0, x [(^g, 1] and [(5^, 1] x [0, 1]. 
Thus, for n > maxjng, n^} : P(sup„_„ |Z„(u, > e) < e, which proves the 
lemma. □ 



Combining (31), Lemma A. 4, the fact that /i„ — > and asymptotic equicon- 
tinuity of the processes U„(u) = -^J2i'=i[^{Ui < n} — u], V„(f) = 

^Er=i[I{^*<^'}-^] yields 



(34) 



= -Cuiu,v)Vniu) - C^{u,v)Yniv) + ^/nDn{u,v) + Op(l) 



where the bias term D„ is given by 

»1 /•! 



Dniu,v) 



(35) 



1 J-i 



y/n[C{u — thn, V — shn) — C{u, v)] 



If copula C has bounded second order partial derivatives on [0, 1]^, then 
(36) ^/nsup\Dn{u,v)\ = 0{n^^'^hl) = o{l). 

u,v 

Finally, combining (23), (25), (26), (34) and (36) yields 
1 " 



(37) 



v}-C{u,v)] 



1 " 

Cu{u,v)^Y.iHU^<u} 



u 



1=1 



C,{u,v) — Y^[l{Vi <v}-v]+ op{l) 



i=l 



A.2. Weak convergence of the process C^^^. Here, we adapt the fore- 
going proof for the mirrored-type kernel estimator C^^^ given in (13). 
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Step 1. At first, we rewrite 

9 

(38) = [Zn{i, u, v) - Zn{l, u, 0) - Zn{i, 0, v) + 0, 0)] , 



where 



Let us define 



Zn{i,U,v) = -Y,K 



n 1- , 



u - uP 



K 



hn 



Zon{i,U,v) = -Y,HUl'Uu, <V). 
71 . 
1=1 

Similarly as in Step 1 of the proof of Appendix A.l [weak convergence of 
cl^'^^] , we can show that, for each i = 1, . . . ,9, 

SUp\^/n{Zn{(,U,v) — EZn{i,U,v)) 

u,v 

(39) 

- ^/n{Zon{i,U,v) - EZon{i,U,v))\ = Op(l). 

Further, note that 

9 n 

(40) ^ Zonii, u,v) = -Y^ I{U, <u,Vi< v} + F: (u) + G*Jv) + 1. 
e=i i=i 

Combining (38), (39) and (40), implies that, uniformly in {u,v), 

V^{Ci^''\u,v) - C{u,v)) = Bn{u,v) + Ct{n,v)+Op{l), 
where Bn is given by (25) and 



C^(n,^;) = V^n^E 



K 



K 



K J 

hn J 



K 



K 



hn 



hn J 



C{u,v)\. 



Step 2. Now, similarly as in Step 2 of the proof in Appendix A.l, we 
derive that, uniformly in {u,v), 

(41) Ctiu,v) = -Cu{u,v)Vniu) - Cy{u,v)Yniv) + V^Dniu,v)+Opil), 

with the bias term Dn{u,v) given by 



Dn{u,v)=Y,^ 



K 



u-U l 

hr, 



K 



(42) 



K 



v-V, 



K 



hn 



C{u,v). 
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To show that this bias term is uniformly 0(/i^) is straightforward but te- 
dious. The most simple case is if {u,v) G [hn, 1 — hn]'^. Then, (42) boils down 
to 



1 rl 

/ C{u — thn, V — shn)k(t)k{s) dt ds — C{u, v) 
1 J-i 

and the assertion follows simply by Taylor expansion. 

Regarding the remaining cases, we will be dealing explicitly only with 
{u,v) £[l-hn,l?- The other cases may be handled in a similar way. 

Note that Taylor expansion together with the assumptions of the theorem 
imply C{u,v) = u + u — 1 + 0(/i^) uniformly in (n, v) £[1 — 2hn, 1]^- Further, 
routine algebra shows that (42) simplifies to 



„,„,,.E.(^).(^) 



(43) 



+Eg("+;;^'-^)jf^°-'^' 



+ 1 

\ hn J \ hn 



Let us compute 



\ hn J \ /I71 
1 .1 

/ C{u — thn, V — shn)k{t)k{s) dtds 

(u-l)/h„ J{v-l)/h„ 

+ / / {u-thn)k{t)k{s)dtds 

J{u-l)/hn J~l 

/(u-l)/hn rl 
/ {v - shn)k{t)k{s)dtds 

-1 J{v~l)/hn 

/{u-l)/hn n{v~l)/hn 
J lk{t)k{s)dtds 



{u — thn + v — shn — l)k{t)k{s) dt ds 

(n-l)/h„ J{v-l)/h„ 



+ [ {u-thn)k{t)dtK(^^—^\ 

J(u-l)/hn \ hn J 
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+ / {v - shn)k{s) ds K ( ^^—^^ 

J(v-l)/h„ \ rin J 



l{v-l)/hn 

hn J \ hfi 



1 rl 

2^ 



K / tk{t)dt-hn / tk{t)dt + 0{h^). 

J(u-l)/h„ J{v-l)/h„ 



Similarly, 



hn } V 

/{u-\)lhn rl 
/ P{Ui >2 + thn-U, 

-1 J{v-l)/h„ 

(45) Vi<v - shn)k{t)k{s)dtds 

/{u-l)/h„ Av-l)/hn 
J {l-2-thn + u)k{t)k{s)dtds 

= ... = {u-l)Ki^-j—j -hnj ^ tk{t)dt + 0{hl), 



(46) 



h-n ) \ ^rt 



{v-1)k(-j—\-KJ^ tk{t)dt + 0{hl), 



(47) 



-o{hl). 

Combining (43) with (44), (45), (46) and (47) gives us 

Dn{u, v) = u + v-l + 0{hl) - C(n, v) = 0{hl), 
which was to be proved. 



APPENDIX B: PROOF OF THEOREM 2 
B.l. Weak convergence of the processes 

C(LLS) a^jj C(MRS) _ rpj^g p^^^f 

of Theorem 2 for these estimators goes completely along the lines of the proof 
of Theorem 1, apart from a small difference in Step 2. 



32 



M. OMELKA, I. GIJBELS AND N. VERAVERBEKE 



This difference is in calculating the bias term Dn given by (35) for C^^^ 
and by (42) for C^^\ The shrinking of the bandwidths by the function 
b{w) = mm{^/w, \/l — w}, together with condition (9), guarantees that the 
Taylor expansion 

C{u — shnb{u),v — thnh{v)) 

= C{u, v) — shnb{u)Cu{u, v) — thnb{v)Cv{u, v) + 0{hn) 

holds uniformly in {u,v) £ [0,1]^ and {s,t) G [—1,1]^. Applying the above 
expansion in the bias calculations completes the proof. 

B.2. Weak convergence of the process C^^. The proof is completely 
analogous to (and simpler than) the proof of Theorem 1 for C^^^ . The only 

" (T) 

difference is in calculating the bias D^, which is for the estimator CA given 
by 

Dniu,v)= C{^{<^^^{u) - sK),<^i^-^{v) -thn))k{s)k{t)dsdt. 

As all the second order partial derivatives of C{^{^~^{u) — shn), ^{^~^{v) — 
thn)) taken as a function of (s,i) are bounded by the assumptions of the 
theorem, Taylor expansion gives us Dn{u,v) = 0{h'^) uniformly in {u,v) 
which proves the statement. 

APPENDIX C: JUSTIFICATION OF BOOTSTRAP TESTS BASED ON 

^(LL) 

Denote the process underlying the goodness-of-fit statistics as Gn = \/n x 
(Cn^^^ — Cq ) and G* = \fn{c'^'^^* — Cg« ) for its bootstrap version. In the 
following lemma, we will suppose that the true copula C belongs to a known 
parametric family of copulas Cq = {Cg, ^ G 6}. 

Lemma C.l. Assume that the parametric family of copulas Cq satisfies 
the assumptions of Theorem 1 of Genest and Remillard (2008) and, more- 
over, the derivative of the true copula Gg^ with respect to 9 is continuous as 
a function of {u,v) in [0,1]^. Then, there exists a "nonstochastic" sequence 
of functions an from [0, 1]^ to [0, 1], such that {Gn — an-,G*^ — On) converges 
in distribution to two independent copies of the same process. 

Proof. From the proof of Theorem 1 of this paper, it follows that 
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where 



D„{u,v)= / [Ce{u-thn,v - shn) - Ce{u,v)]ku,h{s)k^^h{t)dtds 




(48) 

Thus, the processes G„ and G* might be rewritten as 



(49) Gn = V^{C^^^ - CJ + V^D^" + Op(l), 

(50) G: = ^/^(C(^)* - C^, ) + V^Di- + Op{l). 



As the empirical copula process ^yn{Cn — Ce^) converges weakly, Theorem 
1 of Genest and Remillard (2008) implies that the first terms on the right- 
hand sides of (49) and (50) converge jointly in distribution to independent 
copies of the same process. Thus, defining an{u, v) as ^/nDf^ [u, v) , it remains 
to show that 



But this follows directly from (48), the first order Taylor expansion of 
around the true value of the parameter find the assumptions of the lemma. 



APPENDIX D: VERIFICATION OF (9) FOR SOME FAMILIES OF 

COPULAS 

We will verify assumption (9) only for Cuu- The assumptions about Cuv, 
Cyv may be checked analogously. 

Clayton and Gumbel copulas. Clayton and Gumbel copulas belong to 
an Archimedean family of copulas given by 



where the function (j) is called a generator of the copula. The generator of 
a Clayton copula is given by (^(t) = ^{t~^ — 1) with 9 > and that of a 
Gumbel copula by (l){t) = (— logt)^ with 6 >1. 
Direct differentiaton of (51) yields 




□ 



(51) 



Cu{u,v) 



(52) 



ck'{C{u,v)y 




[ct>'iu)]^^"{Cin,v)) 
[<P'{C{u,v))f 
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For a Clayton and a Gumbel copula, it is easy to verify that = 0( ^(1^^) )- 
Hence, we can bound the first term on the right-hand side of the expression 
for Cuu{u,v) in (52) uniformly in v by 



(53) 



(t>'{C{u,v)) 



< 



cp"{u) 



<P'{u) 



(t>'{C{u,v)) 



< 



(P"{u) 



(p'{u) 



\Cu{u,v)\ 



o 



1 



,u(l — u) J 

The second term on the right-hand side of the expression for Cuu{u,v) in 
(52) is a more delicate one. For a Clayton copula, we have 4>'{t) = —t^^^^ 
and 0"(t) = {9 + l)*-^-^, which implies 

[4>'{u)f(l)"[C{u, v)) [6 + 1) [C(n, f )]3^+3 



W{C{u,vW 



(54) 



u2«+2[C(^i,t;)]^+2 

{e + i)[c{u,v)f'+^ 



,2B+2 



< 



'+1 



o 



using the Frechet-Hoeffding upper bound for a copula [see Nelsen (2006)]. 
Combining (53) and (54) verifies (9) for Cuu of a Clayton copula. 
For a Gumbel copula we have 

-1^ 



(l)'{u) = e{-ioguy- 

(j)"{u) = e{9-l){-logu)^~^ 
which implies 



+ e{-\ogu) 



eie-i)i-iogC{u,v))'-^ 
+ e{-iogC{u,v)f~^ 



(55) 



[C{u,v)]'' 
1 



X u 



ei-iogCiu,v)f~^ 



C{u,v) 



3\ -1 



-{-loguf'^~^C{u,v) 



X [{9 - 1)(- log C{u, v))'-^' + (- log C{u, v)r--''] . 



\2-2ei 
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When u ^ O4. , the key fact is that 

^ ^ u2 (-logCKi;))2^-2 - n2 (-log^i)2«-2 ^' 

and, when n ^ 1_, 

(-logn)2^-2 ^ (-log^)2e-2 _ 1 _^ 
V^'i 7 TTTTTT— wm3T - 7 1 " 1 " ^ 



(— log C(n, ?;))2^~i (— logM)2^~i — logu vl — 
Combining (53), (55), (56) and (56) verifies (9) for Cuu of a Gumbel copula. 

Normal copula. The normal copula is given by 

^^"'^^=7-00 loo WT^'^H 2(1-, 2) j^^^^' 

PG(-1,1), 

where $ is the cumulative distribution function of a standard normal vari- 
able. 

By a direct computation (or with the help of properties of a conditional 
normal distribution), we get 



Cu{u,v) = ^l- 



whose derivative with respect to u is given by 

where (j) = As (j) is bounded, it is sufficient to deal with [(l){^~^{u))]~^ . 
L'Hopital's rule yields 

u(l — u) 1 — 2u 

1 / ^ 1 / N = '^(l)' for n^0+ (u^ 1_), 

which, together with (58), verifies (9) for Cuu of a normal copula. 

Student copula. The Student copula (with m degrees of freedom) is given 

by 

rt-'{u) rt-\v) I / s2_2psi + i2x -(m+2)/2 

C{u,v)= / ^ ^ 1 + dsdt, 



27rA/l V m(l-p2) 

pe(-i,i), 

where tm^(-) is the quantile function of the Student distribution with m 
degrees of freedom. 
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Direct calculation shows that 



^(m+2)/2 

Cuiu,v) = , 



/™(tmH^x))(c + d)(™+l)/2 



1 



(59) 



{t-\v)-pt-\u))/y^ 




(l + x2)-(-+2)/2dx 



where c = [t^{u)]^, d = m(l — p^) and fm is the density of the Student 
distribution with m degrees of freedom. Assumption (9) for Cuu of a Student 
copula can be verified by differentiating (59) with respect to u. The useful 
facts (which follow by I'Hopital's rule or properties of the density fm) are 



where is the derivative of fm- 
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